Accelerating Stochastic Simulations on GPUs Using OpenCL
نویسندگان
چکیده
منابع مشابه
Accelerating Molecular Dynamics Simulations with GPUs
Molecular dynamics simulations are known to run for many days or weeks before completion. In this paper we explore the use of GPUs to accelerate a LennardJones-based molecular dynamics simulation of up to 27000 atoms. We demonstrate speedups that exceed 100x on commodity Nvidia GPUs and discuss the strategies that allow for such exceptional speedups. We show that traditional molecular dynamics ...
متن کاملAccelerating high-order WENO schemes using two heterogeneous GPUs
A double-GPU code is developed to accelerate WENO schemes. The test problem is a compressible viscous flow. The convective terms are discretized using third- to ninth-order WENO schemes and the viscous terms are discretized by the standard fourth-order central scheme. The code written in CUDA programming language is developed by modifying a single-GPU code. The OpenMP library is used for parall...
متن کاملAccelerating Double Precision FEM Simulations with GPUs
In visualization and computer graphics it has been shown that the numerical solution of PDE problems can be obtained much faster on graphics processors (GPUs) than on CPUs. However, GPUs are restricted to single precision floating point arithmetics which is insufficient for most technical scientific computations. Since we do not expect double precision support natively in graphics hardware in t...
متن کاملAccelerating QDP++ using GPUs
Graphic Processing Units (GPUs) are getting increasingly important as target architectures in scientific High Performance Computing (HPC). NVIDIA established CUDA as a parallel computing architecture controlling and making use of the compute power of their GPUs. CUDA provides sufficient support for C++ language elements to enable the Expression Template (ET) technique in the device memory domai...
متن کاملAccelerating Radiosity on GPUs
We propose a novel approach to implement radiosity on GPU with specific optimizations via form-factor matrix transformations. The proposed transformations enable to reduce the amount of computations for multiple-bounce global illumination and apply DXT compression (with subsequent hardware decompression when reading formfactors on GPU). Our implementation is 10 times faster running and requires...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEICE Transactions on Information and Systems
سال: 2019
ISSN: 0916-8532,1745-1361
DOI: 10.1587/transinf.2019edl8030